NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Integrating and Characterizing HPC Task Runtime Systems for hybrid AI-HPC workloads

https://doi.org/10.1145/3731599.3767587

Merzky, Andre; Titov, Mikhail; Turilli, Matteo; Jha, Shantenu (November 2025, ACM)

Free, publicly-accessible full text available November 15, 2026
ROSE: RADICAL Orchestrator for Surrogate Exploration

https://doi.org/10.1145/3731599.3767347

Alsaadi, Aymen; Wang, Tianle; Park, Andrew; Bajracharya, Pradeep; Wang, Linwei; Sun, Fanbo; Seal, Sudip; Jadhao, Vikram; Fox, Geoffrey; Jha, Shantenu (November 2025, ACM)

Free, publicly-accessible full text available November 15, 2026
PaleoSTeHM v1.0: a modern, scalable spatiotemporal hierarchical modeling framework for paleo-environmental data

https://doi.org/10.5194/gmd-18-2609-2025

Lin, Yucheng; Kopp, Robert E; Reedy, Alexander; Turilli, Matteo; Jha, Shantenu; Ashe, Erica L (May 2025, Geoscientific Model Development)

Geological records of past environmental change provide crucial insights into long-term climate variability, trends, non-stationarity, and nonlinear feedback mechanisms. However, reconstructing spatiotemporal fields from these records is statistically challenging due to their sparse, indirect, and noisy nature. Here, we present PaleoSTeHM, a scalable and modern framework for spatiotemporal hierarchical modeling of paleo-environmental data. This framework enables the implementation of flexible statistical models that rigorously quantify spatial and temporal variability from geological data while clearly distinguishing measurement and inferential uncertainty from process variability. We illustrate its application by reconstructing temporal and spatiotemporal paleo-sea-level changes across multiple locations. Using various modeling and analysis choices, PaleoSTeHM demonstrates the impact of different methods on inference results and computational efficiency. Our results highlight the critical role of model selection in addressing specific paleo-environmental questions, showcasing the PaleoSTeHM framework's potential to enhance the robustness and transparency of paleo-environmental reconstructions.
more » « less
Free, publicly-accessible full text available May 14, 2026
xGFabric: Coupling Sensor Networks and HPC Facilities with Private 5G Wireless Networks for Real-Time Digital Agriculture

https://doi.org/10.1145/3731599.3767589

Kurafeeva, Liubov; Subedi, Alan; Hartung, Ryan; Fay, Michael; Biswas, Avhishek; Jha, Shantenu; Kilic, Ozgur; Krintz, Chandra; Merzky, Andre; Thain, Douglas; et al (November 2025, ACM)

Free, publicly-accessible full text available November 15, 2026
Pareto Prompt Optimization

Zhao, Guang; Yoon, Byung_Jun; Park, Gilchan; Jha, Shantenu; Yoo, Shinjae; Qian, Xiaoning (April 2025, 13th International Conference on Learning Representations (ICLR 2025))

Free, publicly-accessible full text available April 25, 2026
Pareto Prompt Optimization

Zhao, Guang; Yoon, Byung-Jun; Park, Gilchan; Jha, Shantenu; Yoo, Shinjae; Qian, Xiaoning (April 2025, The Monticello gazette)

Free, publicly-accessible full text available April 25, 2026
Deep RC: A Scalable Data Engineering and Deep Learning Pipeline

Sarker, Arup; Alsaadi, Aymen; Halpern, Alexander; Tangella1, Prabhath; Titov, Mikhail; Perera, Niranda; Staylor, Mills; von_Laszewski, Gregor; Jha, Shantenu; Fox, Geoffrey (June 2025, 28th edition of the workshop on Job Scheduling Strategies for Parallel Processing. JSSPP 2025 https://jsspp.org/)

Significant obstacles exist in scientific domains including genetics, climate modeling, and astronomy due to the management, preprocess, and training on complicated data for deep learning. Even while several large-scale solutions offer distributed execution environments, open-source alternatives that integrate scalable runtime tools, deep learning and data frameworks on high-performance computing platforms remain crucial for accessibility and flexibility. In this paper, we introduce Deep Radical-Cylon(RC), a heterogeneous runtime system that combines data engineering, deep learning frameworks, and workflow engines across several HPC environments, including cloud and supercomputing infrastructures. Deep RC supports heterogeneous systems with accelerators, allows the usage of communication libraries like MPI, GLOO and NCCL across multi-node setups, and facilitates parallel and distributed deep learning pipelines by utilizing Radical Pilot as a task execution framework. By attaining an end-to-end pipeline including preprocessing, model training, and postprocessing with 11 neural forecasting models (PyTorch) and hydrology models (TensorFlow) under identical resource conditions, the system reduces 3.28 and 75.9 seconds, respectively. The design of Deep RC guarantees the smooth integration of scalable data frameworks, such as Cylon, with deep learning processes, exhibiting strong performance on cloud platforms and scientific HPC systems. By offering a flexible, high-performance solution for resource-intensive applications, this method closes the gap between data preprocessing, model training, and postprocessing.
more » « less
Free, publicly-accessible full text available June 7, 2026
Deep RC: A Scalable Data Engineering and Deep Learning Pipeline

Sarker, Arup; Alsaadi, Aymen; Halpern, Alexander; Tangella, Prabhath; Titov, Mikhail; Perera, Niranda; Staylor, Mills; Laszewski, Gregor von; Jha, Shantenu; Fox, Geoffrey (June 2025, Springer. JSSPP 2025: Job Scheduling Strategies for Parallel Processing)

Significant obstacles exist in scientific domains including genetics, climate modeling, and astronomy due to the management, preprocess, and training on complicated data for deep learning. Even while several large-scale solutions offer distributed execution environments, open-source alternatives that integrate scalable runtime tools, deep learning and data frameworks on high-performance computing platforms remain crucial for accessibility and flexibility. In this paper, we introduce Deep Radical-Cylon(RC), a heterogeneous runtime system that combines data engineering, deep learning frameworks, and workflow engines across several HPC environments, including cloud and supercomputing infrastructures. Deep RC supports heterogeneous systems with accelerators, allows the usage of communication libraries like \texttt{MPI}, \texttt{GLOO} and \texttt{NCCL} across multi-node setups, and facilitates parallel and distributed deep learning pipelines by utilizing Radical Pilot as a task execution framework. By attaining an end-to-end pipeline including preprocessing, model training, and postprocessing with 11 neural forecasting models (PyTorch) and hydrology models (TensorFlow) under identical resource conditions, the system reduces 3.28 and 75.9 seconds, respectively. The design of Deep RC guarantees the smooth integration of scalable data frameworks, such as Cylon, with deep learning processes, exhibiting strong performance on cloud platforms and scientific HPC systems. By offering a flexible, high-performance solution for resource-intensive applications, this method closes the gap between data preprocessing, model training, and postprocessing.
more » « less
Free, publicly-accessible full text available June 3, 2026
Hydra: Brokering Cloud and HPC Resources to Support the Execution of Heterogeneous Workloads at Scale

Alsaadi, Aymen; Jha, Shantenu; Turilli, Matteo (September 2024, Association for Computing Machinery)

Full Text Available
Radical-Cylon: A Heterogeneous Data Pipeline for Scientific Computing

Sarker, Arup Kumar; Alsaadi, Aymen; Perera, Niranda; Staylor, Mills; von_Laszewski, Gregor; Turilli, Matteo; Kilic, Ozgur O; Titov, Mikhail; Merzky, Andre; Jha, Shantenu; et al (December 2024, Springer Nature Switzerland)

Full Text Available

« Prev Next »

Search for: All records